Learning Rewrite Rules for Search Database Systems Using Query Logs

نویسندگان

  • Monu Kedia
  • Dinesh Garg
  • Sriram Raghavan
چکیده

Recent literature on “search database systems” has introduced the notion of using query rewrite rules to influence the behavior of a search engine. Rewrite rules enable domain experts and search administrators to customize the search engine by providing a powerful rule-driven framework to transform user search queries. In this paper, we address the important problem of automatically learning such query rewrite rules from query logs using a novel Hidden Markov Model (HMM) formulation. Our formulation captures both the latent information present in the sequence of queries issued within a session as well as the explicit information in the form of user clicks on individual results. We have developed the notion of support and confidence for query rewrite rules and leveraged these concepts to harvest high quality rewrite rules from the trained HMM. We propose rigorous evaluation schemes to quantify the efficacy of a set of rewrite rules and demonstrate the effectiveness of our approach through extensive experiments using the publicly available AOL query log data set and the YAGO concept ontology.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relational Databases Query Optimization using Hybrid Evolutionary Algorithm

Optimizing the database queries is one of hard research problems. Exhaustive search techniques like dynamic programming is suitable for queries with a few relations, but by increasing the number of relations in query, much use of memory and processing is needed, and the use of these methods is not suitable, so we have to use random and evolutionary methods. The use of evolutionary methods, beca...

متن کامل

Ontology Based Query Processing in Database Management Systems

The use of semantic knowledge in its various forms has become an important aspect in managing data in database and information systems. In the form of integrity constraints, it has been used intensively in query optimization for some time. Similarly, data integration techniques have utilized semantic knowledge to handle heterogeneity for query processing on distributed information sources in a ...

متن کامل

Analysis of User query refinement behavior based on semantic features: user log analysis of Ganj database (IranDoc)

Background and Aim: Information systems cannot be well designed or developed without a clear understanding of needs of users, manner of their information seeking and evaluating. This research has been designed to analyze the Ganj (Iranian research institute of science and technology database) users’ query refinement behaviors via log analysis.    Methods: The method of this research is log anal...

متن کامل

Mining Low-Risk Rules for Altering Query Terms from Large-Scale Logs of Query Reformulations

A widely-used method that Web search engines use to improve relevance is automatically altering terms in the user‘s query, in order to overcome potential vocabulary mismatches between the query and relevant Web pages. In commercial search engines, a large percentage of all queries are altered in some way. While such query alteration has significant upside potential to improve relevance for many...

متن کامل

Towards Cost-based Query Optimization in Native XML Database Management Systems

In the last few years, XML became a de-facto standard for the exchange of structured and semi-structured data. The database research community took this development into account by proposing native XML database management systems for efficient and transactional management of XML documents. One of the most important factors for success of such systems is a powerful query optimizer. Many research...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014